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SUMMARY 

Probabilities  of  Excessive  Deviations  of 
Simple  Linear  Rank  Statistics 

Let  {S^}  be  a sequence  of  r.v.'s  whose  asymptotic  distribution 
Is  N(0.  0^)  and  let  {x^}  be  a sequence  of  constants  with  ''s  A 
(rlghthand)  excessive  deviation  la  an  event  of  the  form  {Sj^  > 

The  asymptotic  normality  of  Sjj  tells  us  P{Sj^  > 0,  N > 

but  not  the  rate  of  this  convergence.  These  rates  are  needed  for 
the  evaluation  of  Bahadur  (Xj^  ■ xi^)  and  Bayes  Risk  (Xjj  - x/Toy,  N) 
efficiencies.  When  S„  Is  a k-sample  linear  rank  statistic  (see 
Hajek  and  ^Idak  (1967)),  and  x^  ■ o(N),  we  show 
log  P(Sj^  > '''  “ assuming  the  null  hypothesis.  Tn  the 

tvo-eample  case,  we  establish  also  that 

P{Sjj  > ■ o{exp[-x^/2  + J(Xjjf  when  NXj^  (0  <•  '■  i)  is 

the  size  of  the  first  sample  and  J is  a function  whose  behavior  Is 
analyzed  for  various  and  For  example.  If  x^  ••  o(N),  then 
J(Xjj,  Ajj)  • o(l)  as  N 


II 


Introduction 


Simple  linear  rank  statlatlce  arise  in  a variety  of  situations, 
particularly  in  the  problan  of  casting  the  equality  of  two  or  more 
distributions  by  non>paranetric  tests.  While  the  asymptotic 
distributions  of  simple  linear  rank  statistics  have  been  studied 

V 

extensively  (aeci  e.g.,  Hkjck  and  Sldik  (1967)),  Investigations 
concerning  their  large  deviation  properties  have  been  made  only 
recently  (Stone  (1967,  1968,  1969)  and  Woodworth  (1970)).  Tn  this 
paper  we  study  the  rates  of  convergence  to  eero  of  null  probabili- 
ties of  excessive  deviations  of  k-sample  (k  % 2)  simple  linear 
rank  statistics.  We  begin  in  this  section  with  dlecusslons  of  the 
notions  of  an  excessive  deviation  and  of  a simple  linear  rank  sta- 
tistic. The  rceulte  ere  in  Section  2 and  the  proofs  are  in 
Section  3. 

The  concept  of  an  excessive  deviation  of  a random  variable  will 
be  discussed  here  in  a general  setting.  Let  (Sj^,  N > 1 ) be  a 
sequence  of  random  variables  with  positive  finite  varl.Tnces  and 
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let  {Ojj,  N - 1)  be  a sequence  of  poiiltlve  constmi'  ; snrii  t h.u 

(1.1)  ■^p|P{Sj,SX0jj)  - ♦(x)|  0,  N ->  •, 

where  $(•)  la  the  atandard  normal  distribution  function. 

Let  {Xj^i  N ^ 1}  be  a aequence  of  nonnegative  real  numbers. 

We  call  probability  of  a (rlght-hnnd) 

deviation  of  alae  of  S^.  A trivial  conaequence  of  (1.1)  Is 

(1.2)  Pjj(x^)  ^ ♦(-Xj,),  N - 

provided  Xj^  ■ 0(1).  However,  when  Xjj  «,  (1.2)  la  clearly  no 
longer  a direct  consequence  of  (1.1),  which  leads  one  t»J  auk: 
given  {S^}  and  {xj,},  where  x^^  •>  « at  none  apeclfled  rate,  does 
(1.2)  hold?  If  not,  what  exactly  la  the  aaymptotlc  behavior  of 
P|^(x^)7  To  answer  theae  queatlona,  tools  more  refined  than  tiie 
central  limit  theorem  are  needed. 

Wo  will  now  Introduce  aone  teminolorty,  due  .<*  Huhln  and 
Sethuraman  (1965a).  If  ■ 0(l),  the  event  f«j^>x.jaj,^}  in  called  nn 
ordinary  dayiation  of  while,  if  x^  «».  it  la  an  excesalvc 


deviation  of  Sy.  ^fo  special  cases  of  excessive  deviations  have 


••parat*  nameai  becauaa  of  eha  appllMtleBa  they  have  found  in  atat- 

iatlcB.  The  caaa  wbara  x^/i.og  N c , 0 < c < » Is  a moderate 

deviation  of  S^i  which  arises  in  the  study  of  Bayes  risk  efflclencv 

(see  Rubin  and  Sethuraman  (1965b)  and  Cllckner  (1972)).  The  moat 

extoiiBlvely  studied  excessive  deviation  la  the  large  devl.iMou.  in 
2 2 

which  Xjj/N  ■»  c . This  attention  .irlMes  from  the  fa«-t  th.it 
probabilities  of  large  deviations  must  be  evaluated  In  order  to 
compute  Bahadur  efficiencies  (sea  Bahadur  (I960)). 


Much  of  the  previous  work  on  excessive  deviations  has  been  for 
sums  of  Independsnt  random  variables.  Whsn  Is  the  mean  of  l.l.d. 
random  variables  with  asro  mean,  unit  variance  and  finite  moment 
generating  function,  and  Ojj  - n”*""*,  Cramer  (1938)  has  shown 

(1.3)  *C-Xj^)exp[x®N"’*  X (Xj,jN~**)] , 

where  " 0(M).  and  where  X(s)  is  a function  which  admits 

a convergent  power  series  expansion  for  small  |z|.  A corollary 
of  (1.3)  is  that  (1.2)  holds  if  x^  • o(N). 


I 


Fell#r  wnerallznd  Cramdr's  (1938)  renult  to  non- 

identioally  distrll)Uted  randon  varinbloo.  Ttubln  and  Rethuranan 
(1965a)  considered  only  moderate  deviationH  but  were  able  to  relax 
the  requirement  of  a moment  fleneretinr  function.  Other  vork  on  the 
problem  has  been  done  by  Cliernoff  (195-0,  Llnnik  (1960,  1961,  1962), 
and  others. 

Only  right-hand  deviations  {S^  > have  been  discussed 

here.  This  practice  elll  be  continued  throughout  this  paper. 

All  results  for  right-hand  ssceeslve  deviations  have  Immediate 
extensions  to  left-hand  deviations  dnd  two-siued 

deviations  (|Sj^|  ^ 

We  will  now  define  a simple  linear  rank  statlatic.  Let 

Xjj^  , ...,  ...,  be  a sequence  of  indepenilmit 

1 k 

and  contlnuouFi  random  variables,  where  k ^ 2,  1,  i ■ 1, 

....  k and  n^^  + . . . + ■ N.  Let  R^j,  ...,  be  the  ranks 

for  the  combined  sample  ...,  X^  . Consider  the  problem  of 

testing  the  null  hypothesis  H:  ...,  are  Identically 

distributed  versus  the  k-sample  alternative  with  density 


k "l 

(1.4)  q(x,i ) ■ n n f(x..  - A.), 


where  f Is  a known  density  and  A^,  ...,  are  known  constants. 


A loi'.'illy  iiiorti  (Miworful  test  of  H vorsuH  <1  Is  ImhocI  oh  ilu'  k s;i'ij  Ic 
slmpltf  J Inaar  rank  statistic  (see  H&jek  and  ^Idfik  (1967)  p.  69). 

k "1 

(1.5)  Sj,  - 

where  •••>  ^ sequence  of  conscantSi  called  scores. 

These  simple  linear  rank  statistics  Include  many  of  the  more 
common  and  Important  rank  statistics.  For  example,  If 
Sj^d)  • 1/N+l,  1"1,  ...,  H,we  obtain  a k**sample  extension  of  the 
classical  Wllcoxon  rank~sum  statistic.  When  " 1 for 

i > (N  + l)/2  and  -1  for  1 s (N  + l>/2,  Sjj  becomes  a median  test.  With 

Bjjd)  - EC*"^(U^^^)],  l-l,  N,  where  < ...  < are 
the  order  statistics  for  a sample  of  size  N fros.  the  uniform 
(0,1)  distribution,  Is  the  Flsher-Yates-Terry-Hoef fdlng  normal 
scores  statistic* 

To  the  authors'  knowledge,  all  previous  work  on  excessive 
deviations  of  linear  rank  statistics  has  been  for  the  case 
of  large  deviations  only  and  under  the  null  hypothesis  H (Stone 
(1967,  1968,  1969)  and  Woodworth  (1970)).  In  this  paper  we  will 
be  considering  only  excessive  deviations  that  arc  not  large 
(l.e.  " but  x*/N  -*■  0)  and  only  the  null  hypothesis  H.  Thus 

there  is  no  overlap  of  the  present  paper  with  Stone's  or 
Woodworth's  work.  In  fact,  our  results  fill  In  a gap  between 
their  work  and  the  work  of  these  many  authors  who  have  studied 
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Che  esynptoclc  nomallCy  of  alnple  linear  rank  aCttClacles  (aee 
tUkjek  and  (1967)  for  rafarancas. 


2.  Main  Reaulta. 
Wa  need  aotne 


■ ^Nl^l 

(2.1) 


notation.  Let 

N1 

+ and 


n^/N.  1*1 I ...f  k.  Let 


Wa  ahall  aiaune 


(2.2)  0 < llalnf  ^ llaaup  < 1, 

N"***  N"*** 

and  o^ji  > 0 for  all  N > k.  Further*  let 

Lat  ba  aa  daflnad  In  (1.5).  Throughout  tha  raat  of  thla  papar 
va  will  aaaume  that  the  null  hypothesis  H holds.  Then  ES^  ■ 0 
and  var  Sjj  • n2o2jj/(N-1)  . Wa  define  to  ba 


t 


30  WO  define  P„ (x„)  to  be 
N N 

(2.3)  **^(’‘1})  ■ ^ 

We  can  now  state  our  main  results. 

j 

i 

' I Theorem  2.1,  i,«t  S,.  be  of  the  form  (1.5).  assume  the  null 

■ - - ■ N 

hypothesis  H obtains  and  define  ^^y  (2.3),  where  “ 

2 

and  x^/N  •*'  0.  Assume 

f 

! ! (2.4)  xj  max  *J(j)  - 0(N) 

IsljSN  “ 

and 

N 

[ (2.5)  yJ  ■ I |aj,(J)l^/N  - o(nVxj,). 

? 1 
I ' 

i . Then,  as  N «>, 

(2.6)  log  Pjj(Xjj)  ^ - xJ/2. 

This  crude  estimate  of  P„(x„)  can  be  Improved  somewhat  In 

N N 

! the  two-samplo  case  (k  ■ 2),  provided  (2.4)  and  (2.5)  are 

strengthened  a little.  When  k • 2,  without  loss  of  generality, 
we  may  set  ■ 1,  " 0,  which  gives  (with  n^n^^), 

s»-  j ‘s'"!!*- 

j-1 

Letting  and  we  get  and  ho 

(2.3)  becomes 

I, 


i 
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(2.8)  Pj.(x^)  - ?{S^  i 

We  now  state  our  additional  results  for  the  two-sample  case. 

Observe  that  (2.9)  Includes  both  (2.4)  and  (2.5). 

Theorem  2.2.  Let  the  null  hypothesis  H obtain,  let  be  of  the 

2 

form  (2.7),  and  define  Pj^(Xjj)  by  (2.8),  where  Xjj  »,  0. 

If 

(2.9)  “ o(N^/x^), 
then,  as  N ->  «, 

(2.10)  P^(5«n^  “ o(expC-x^/2  + J(Xjj,ljj)]>, 
where 

(2.11)  J(Xj^.Xjj)  - (xj/2)(l  - X^Vp^qj,)  + NKXjj.Pjj), 

wliere 

(2.12)  I(X,p)  » Xlog(X/p)  + Alog(X/q), 

where  q ■ 1“P,  qu'"^”PN’  unique  solution  of  the  equation 

N t,  .1 

(2.13)  n - (1  + (qjj/pj^)expC-Xj^a^(f)(XjjTj^/N)^/Pjjqj^,]}“  . 

Theorems  2.1  and  2,2  are  the  best  results  obtainable  with  pre- 


sently  available  methods.  However,  it  would  be  desirable'  to  Improve 

on  these  results  to  obtain  a better  estimate  of  P^(x^),  one 

comparable  to  Cramer's  (1938)  result  (1*3) . To  do  this  in  the 

two-sample  case  a certain  rate  of  convergence  to  normality  of  the 

normed  distribution  4„(*)  of  the  mean  of  certain  dependent  random 

N 

variables  arising  In  the  proof  of  Theorem  2.2  Is  required.  This 
convergence  rate  has  not  been  established.  Let 

(2.14)  - sup  |*jj(x)  - 4(x)|, 

where  i«  defined  in  (3.39).  It  follows  from  Hkjek  (1964) 

that  Ajj  -*■  0 as  N -►  «>.  His  result  Is  reproduced  here  as  Lemma  3.7. 
The  required  convergence  rate  Is  ■ o(l)  where  -*■  » (See 
(3.46)).  If  it  can  be  verified  that  this  rate  holds  then  the 
following  estimate  Is  obtainable: 

(2.15)  'K-Xjj)expCJ(Xj^,Aj^)],  N 

To  extend  Theorem  2.2  to  the  case  k > 2,  a multivariate  gen- 
erallzatlon  of  H&Jek's  (196A)  asymptotic  normality  rrsult  Is 
required.  Unfortunately,  It  too,  le  not  yet  available. 

The  conditions  of  Theorems  2.1  and  2.2  are  stronger  than 
those  required  to  establish  the  asymptotic  normality  of  simple 

V 

linear  rank  statistics  (See,  e.g.,  H8jek  and  Sldfik  (1967) 
pp.  193-195).  This  is  to  be  expected  since  these  theorems  give 
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moro  ilc’taJJpd  an<l  precise  :*nfonitatlon  about  the  asyniptot ic  behav- 
ior of  than  does  a statement  of  asymptotic  normality.  Even  an, 
it  la  natural  to  ask  whether  these  stronger  conditions  are  very 
restrictive!  in  terms  of  potential  applications  Many  simple 
linear  rank  statistics  have  scorea  of  either  of  the  forms 


(2.16)  a^(i)  - E 


or 


(2.17)  aj^,(l)  - ^(i/N+l), 


where  Is  a non-constant  function  on  (0,1)  and,  further,  S. 


N 


ns3nmptotlcally  normal  If  / ^^(u)du  < •»  (See,  e.g.,  Hijek  and 

0 


Sld4k  (1967)  Chapter  V).  Clearly,  If 
1 , 

(2.18)  / |<^(u)fdu  < ~, 

0 


la 


then  (2.9)  holds  for  the  scores  a^^  and  a^^  and  we  may  apply 
Theorems  2.1  and  2.2  to  Some  simple  linear  rank  statistics 

that  have  scores  of  the  form  (2.16)  or  (2.17)  where  ^ satisfies 

(2.18)  are:  median,  Hllcoxon,  Van  dcr  Waerden  and  Flsher-Yatee- 

Terry-Hoeffdlng,  all  tests  for  location  shift,  and  CapoB,  Klotz, 
Ansarl-Bradley,  quartlle,  and  Savage,  all  tests  for  a shift  In 
scale.  See  Cllckner  (1972)  for  details. 


-li- 


lt l8  not  inaiedliitely  obvious  from  Theorem  2.2  ao  precentedt 

exactly  how  faet  tends  to  zero  when  grows  at  a specified 

2 6 3 

rate,  say  • log  W,  or  perhaps  ■ M . It  is  necessary  to 
analyze  J(Xj^,X^|)  for  various  x^^  and  to  see  more  clearly  the 
behavior  of  Pj^(x^j).  This  is  done  in  Corollaries  2.5  and  2.6 
following  the  preliminaiy  Lemmas  2.3  and  2.4. 

Lepune  2.3.  Let  be  the  solution  of  equation  (2.13).  Let  (2.9) 
hold.  Then 

(2.19)  Pp  - * J 

K*1 

where  (c.  (M,  h l)  is  a sequence  of  functions  of  A whose  first 
two  elemonto  are 

(2.20)  e^(X)  »*s-X, 

(2.21)  CgO^)  = (X-»j)(l  ••  1/(2X  A)). 

Further,  if  A^^  « M + 0(11"^),  then  »*a+  0(r"^). 

Lemma  2.4.  Define  l(A,p)  by  (2.12)  for  0 < A < 1 and  0 < p < 1. 

If  1a-p|  < min  (p,q).  then 

(2.22)  l(A,p)  ■ (2pq-Xp*’+Xq^)  + I / ^Vi'^’ 

i.3  ^ q (-p)‘ 
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Coroll>ry  2.S.  In  Th*or«m  2.2  assume  ■ ij  + 0(l/N) 
J(Xj^,Xjj)  ■ o(l). 

Corollegy  t.6.  Under  the  conditions  of  Tlieoren  2,2, 

(2.2«  ■'<*».*»>■  T • 

vhere  td^(X),  l ^ O)  is  a seouence  of  l^inetions  of  X 

element  is 

(2.24)  dQ(x)  - -(1-8X)2/8X  X. 

Further,  if,  i‘or  some  integer  k « 1,  S, 

(2.25)  lininf(xJj^’‘‘^^V^)  > 0 and 
then 

(2.26)  = -^(1),  if  k » 0 

k-1  x^  1 

- I di(Xj,)(-fJ)  + o(l),  if  k > 0. 
i"0 


then 


irhose  first 


u 


i 


I 


1 1 


I.  Proof  R. 


We  begin  the  proofs  with  tome  prellmlnarloA  and  four  lemmas 
which  will  be  used  In  proving  both  Theorems  2.1  and  2.2. 


To  simplify  notation,  we  will  often  write  a^  for  «j,(l),  for 
^Nl*  ®A  •upptessing  the  dependence  on  N. 

Let  < ...  < be  the  order  statistics  for  the  combined 
sample  • • • i lot 


(3.1)  qj  - 


1 if  is  from  the  1-th  sample, 
0 otherwise. 


for  1 *1,  ...,  H,  J *1,  K.  Clearly , ...  ^ ^IN  * ^i ^ 

i * 1,  ...,  k.  Then,  by  rearranging  the  sum. 


(3.2)  Sjj  = I 1 ^l^il* 
J«1  ^ !■! 


Let  f - {(1,  0 0),  (0,  1,  0,  ...,  0),  ...,  (0,  ...,  0,  1)} 


be  a set  consisting  of  k points  with  z " (Sj^,  tj^)  a typical 


element  and  let  ...,  be  e sequence  of  independent  and 


identlcallj'  distributed  random  vectors  taking  values  In  y with 
probabilities  p^,  . . . , Pj^,  > 0,  1 - 1,  . . . , k,  p^  + . . . Pj^  - 1, 
that  is. 


(3.3)  - z)  - n p/^  . 


1-1 


I 


i 


I 
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Let  Z(N)  - + ...  + Z^*^^  - t2j(N),  ....  2j^(N)].  Define  a 

statistic  Tjj  l>y 

N k 

(3.4)  T “ I ti  X . 

^ J-1  1-1 

and  let  t^  be  a realization  of  T^.  The  ^'O  these  proofs  is  the 
observation  that 

(3.5)  P(Sj,  ■ tj^}  - P(Tjj  r tjjlZ(N)  « n}, 


where  n - (n^,  ,,,,  n^^),  and,  further,  (3.5)  la  an  Identity  In  the 
probabilities  p^,  Pj^. 

For  each  h > 0,  define  a new  .iolnt  distribution  for  the 
random  vectors  by 


(3.6)  j-1, 


N) 


N 

n 

J-1 


k 

n 

1-1 


where 


(3.7) 


‘IJ 


P^e 


hajd^ 


k 

I Pi 

i-1  ‘ 


Observe  that,  under  Q,  Z^^\  ....  Z^^^  aie  independent  random 
vectors  taking  values  in  Y,  but  are  not  identically  distributed. 
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Leniati  i . i . Let  S,.  be  ji;iven  by  (3.  '),  'i'  by  mul  Pv(>'m)  1>.V 


(2.3).  Por  any  h > 0 and  any  pj^,  •••*  Pji  Pj  > 0,  1 « 1,  . . . , k, 
pj^  + ...  + pj^  ■ 1 , we  have 


IJ  K 1V...A, 


(3.8)  - p{2(in-VT;  >1^1' 


where 


(3.9)  Aj,  » 0 Q{Tn  ^ t|^,lZ(N)  n n), 


and  Ejj  denotes  8tir,£.ation  ever  thoae  t^j  '■utiDAyl;,^':  tj,,  ^ 
N 


Proof.  From  (3.6)  and  (3.7), 


(3.10)  Pj^(xj^) 


[P(Z(N)^n}]"^  E , Z(N)«n) 


N k ha.fi. 

11  I I P.e  h 
■1=1  i»l 


P(Z(N)«n) 


I o QfVt  Z(fJ)^n}. 
Xj^  i.  w 


which  is  equal  to  the  rlnht  hand  side  of  (3.8).  I'  't  ^ 3.1  in  no'. 


proved. 


Observe  that  (3.8)  Is  an  Identity  in  the  k+1  arbitrary 


variables  p^,  ....  p^,  and  h.  We  will  later  exploit  this  fact  by 
making  convenient  choices  for  these  quantities.  But  first  we 


will  obtain  asymptotic  approximations  to  all  the  factors  In  the 


tgbt-hand  side  of  (3.8)  except  A^j.  We  begin  with  P{Z(N)  » n' 
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Lenma  1.2.  For  any  p^,  Pj^  *uch  that  > 0,  i * i,  k 
Pj_  •••  + Pjj  “ 1»  we  have,  under  (2.2),  as  N 

Ic 

(3.11)  P{Z(N)-n)  -v  n xI^exp[-N  J X.  log(X,/p,)]. 

i-1  ^ 1»1  ^ ‘ ^ 

Proof.  Apply  Stirling's  formula. 

Uana  3.3.  i,et  + ...  + and  “ 

. . . , k-1 ) , where 

»»  ■ ’ij'i-’ij > 

" If  1 1“- 

A..uma  lira  [ f " f (..y)  f.  . poiitlv.  definite  netrlx. 

lfr*«  J«1  ^ 

we  have,  uniformly  in  n, 

(3.12)  (2irH)^’*"^^-'®(det  r)Q{Z(H)«n)  - exp(Jj  0,  F - 

where  C*j  - (N”*^(nj^-Nqj^  J , . . . •»“**<  . ) ) . 

itoot.  Ut  Ej,  . (N“’^[Z^(N)-Nqj^J..ee,ll“**lZj^_^(N)-Nq^_3^J). 
Clearly  is  asymptotically  norsMl  with  saro  mean  vector  and 

y 

covariance  matrix  Rwacava  (1954)  has  proven  a local  limit 
theorem  for  sums  of  i.lede  lattice  vectors.  To  prove  (3.12) 
we  follow  her  argument  In  outline,  varying  the  details  to 
handle  our  non- Identically  distributed  vectors  and  taking  ad- 
vantage of  the  apeclal  structure  of  Z(N).  See  Clickner  (197.') 
for  the  details. 


und 


1. 


Thou 


-17- 


We  no.  need  bom  notation.  Let  vnr^,  etc.,  denoti 
expectation,  variance,  etc.  under  Q.  Then 


(3.13) 


and 


)( 

(3.1A) 


1-1 


Also,  define  -f  , . . + pj^Aj^  and 


(3.15)  tJ  - - v,)2. 


The  notlvatlon  for  the  bound  on  h in  Laoma  3.4  will  be  made  clear 
In  (3.24)  and  (3.25). 


L 

Lemma  3.4.  Let  h ■ where  hj^  i KXjj/N  , for  some  K,  0 < K < ». 
Assume  Xj^  •+  *,  ■*■  0,  and  (2.4)  obtalne.  Then,  as  N •>  “, 

N k h»a.A,  90 

(3.16)  I loR(  ^ p.e^  J ^)  - Nli^T‘/2  + 0(Nh^Yp. 

j-1  1-1  ^ 

(3.17)  q^.  - p^  + »JPihJc(A^  - v^)2  - tJ]  + O(hJyJ).  i-1 k. 

(3.18)  EqTj^  - Nhj^rJ  + O(NhjYj). 

Vj  - N I 4jp^(l-Pi)  + O(Nh^Yj). 

1-1 


(3.19) 
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(3.20)  I q..(l-q..)/N  - Pj(l“P|)  + 
j«l 

(3.21)  + 0(hj)  , 1-t. 


Proof.  Conalder  tho  l«i't-h«nd  aid*  of  (3.16) i 


J^P^a  ^ - 1 + + ’^Vj 


where  e i 1.  Then 


^ ^ 999  99 


- + OChjJj.j  I 


- N1iJtJ/2  + O(NIi^yJ). 


Expreaalona  (3.17)  - (3.21)  follow  almllarly.  The  proof  of  Lemma 
3.4  Is  complete. 
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I'toof  Theorem  2. 1 

In  addition  to  Leinmai  3il-3.4,  we  need  two  Icnanae  epticif- 
U'ally  for  Theorem  2.1. 

Lemma  3.5.  Define  by  (3.9).  Then  for  each  N and  any  h > 0, 

(3.22)  0 2 log  Ajj  + hJtjjO^N** 

a -4Vjh  + logd  - Ep((Tjj  - 2V,  “ *^0^i/*)2|Z(N)ib1/4V2). 

Proof.  Clearly  s exp [-hXj^o^N**] . On  the  other  hand, 

N i axp[-hx^o^H**  - 4V.ph]Q{x„o^N‘*  S T^,  « x^o^N^  + 4V.rl2(N)-n) 
a expl-hx^o^N**  - 4Vjh]a  - Eq[(Tj,-Xj,o^N^  - 2V^)2 |z(N)-n] /4vJ;} , 

by  Chebyahev's  inequality.  LemiBa  3.5  felleira. 

We  will  now  chooae  valuea  for  p^,  ...,  Pj^  *nd  h for  the 
k-eample  caec.  Let 

(3.23)  Pi  ■»  1 - 1.  ....  k 

and  let  h ■ h„  bo  the  iMiaita  r olublc  i o.' 

N 

(3.24)  B^Tj,  - XjjO^n'  + 2Vj. 


1 I iniiiinaiiiiiBMiiMailniin 
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This  choice  of  maximizes  a term  In  the  lower  bound  of  In 
Lemma  3.5.  A almple  argueflMnt  thowa  that  ii  well  defined  by 

(3.24),  end,  further 

(3.25)  h„ 


Lerrtns  3,f.  Let  (3.23)  hold  end  let  hjj  be  the  solution  of  (3,24), 
setlefyine  (3,25).  Then,  ee  N -►  «, 


(3.26)  E^(Tj,|Z(N)  - n)  B^Tj, 
end 

(3.27)  verQ(Tj,lZ(N)  - fl)  -v*  V^. 

Proof.  Consider,  for  i « 1,  k,  J ■ 1,  •••,  N, 

Q(Z(N)  - n^} 

(3.28)  Q(ZjJ'  - 1|Z(N)  - n}  " Q{zCNr°lT  ’ 


where  n.  * (n^,  •••*  ^ ”** 

From  (3.17),  (3.20),  (3.21)  and  (3.23),  Z » lim  N”^  I Lj  kea 

elements  of  the  form  X^(l-Xj)  on  the  main  diagonal  end  off 

k k 

main  diagonal.  Glnce  I 1.  ■ 1,  det  I ■ n X. . Hence,  by  Lemma  3.3, 

i-1  ^ i-1 


■ exp{  (X  )/A  j “ l/2MXj^  ) 

■ exp(0(:c^/N))  , 
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I 


I' 

i; 


uniformly  lni«l,  Hence, 


(3.30)  E.(T,Jz(N)-n)  ■ X *.  I 

« jnl  ‘ - 


l|Z(N)-n)  <v  EjjT„. 


Similarly , fori,i'“l,  ...,k,J,J'»-l, 
(3.31)  Q(z|J^«l,  z[r^-l|z(N)«n)  % 


uniformly  in  i,  i',  J end  J'.  Tlie  Joint  prob-ibility  on  the  left-hand 
aide  of  (3.31)  is  zero  if  J ■ J'.  Hence 

W k 

(3.32)  E(^{Tj|2(N)-n)  - I I A®Q{z['J^-l|Z(W)-ri) 

+ llll  zjf  ^-1  |Z(N)-n}  <v  E-tJ. 

J»»J'  or  ij«i'  ^ ^ ^ ^ ^ " 

Lemno  3.6  followa. 


We  can  now  prove  Theorem  2.1.  Thie  will  be  done  by  eub- 
scitucing  Lemmaa  3. 2-3.6  In  Lemma  3.1.  More  epecifically,  let 
(3.23)  hold  and  let  h ■ hj^  eolve  (3.24)  which  entaila 

L 

h.,  -V  x„/N^ci,.  From  Lemmaa  3.2  and  3.3, 

N N A 


expl0(xjJ/N)l, 


using  (3.17).  From  Leons  3.4,  specifically  (3.16),  we  have 

(3.34)  I log(  I X.e  " J ^)  x?/2. 

J-1  i-1  ‘ 


Ey  the  definition  of  hj^, 


EqC(Tj,  - x^o^N**  - 2Vj)2|Z(N)-n3  - varp(Tj,|Z(N)-n)  vj. 


by  Lemma  3.6,  Hence,  fton  Lemma  3 >5, 


(3.35)  log  Ajj  'y>  -x^. 


Theorem  2.1  now  followa  by  subsCicutlng  (3.33),  (3.34),  and  (3.35) 
in  Lemma  3.1. 


Proof  of  Theorem  2.2 

The  main  difference  between  the  proofs  of  The.'rems  2.1  and 
2.2  lies  In  the  treatment  of  the  sum  A^,  defined  in  (3.9).  Hert^, 
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we  write  as  A^j  - exp[-hEqTj^]Cjj, 
where 

(3.36)  d*j,(y), 
where 

(3.37) 

(3.38) 

(3.39) 
and 

(3.40)  B - CXj^(XXj^N)‘*  - E^jTjj]/V. 

Observe  Chat  ^^|(*)  is  the  conditional  distribution  referred  to  In 
the  discussion  following  Theorem  2.2  (see  equation  (2.14)  }. 

Now  select  and  h.  Choose  to  be  the  solution  of 


■ Ji‘*i  ■ 

«j,(y)  - Q(T^  S yV  + E^Tj,  |z(N)-n}, 
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Thla  iB  the  of  (2.13).  Choose 

(3.42)  h - hy  - 

Noes  that  Is  wall-defined  by  (3.41)  and  we  have 
Pjj  'u  end  hj,  'v,  *j,/ (Aj^j,N)‘*,  so  these  choices  are  only  slightly 
different  from  those  in  the  proof  of  Theorem  2.1  ~ but  era  more 
convenient  for  the  proof  of  Theorem  2.2.  The  reasons  for  these 
choices  will  become  apparent  in  Lemma  3.7  and  display  (3.45). 
Lemmas  3.7  and  3.8  constitute  the  analysis  of  A^. 


1 3<Z«  (Hfijek  (196b)),  Let  (p, 41)  hold.  A necessary  and 
sufficient  condition  for  0,  where 

and  is  defined  in  (3.39)  i,e 


-25 


(3.43)  V*^  I (a.-a)S.if-  0 

leA^ 

c 

for  all  e > 0,  where  A ■ {l:|a.-a|  > cV>. 

G A 

Proof.  This  is  Theorem  7.1  of  H&Jek  (I96U). 

Lenroa  3.8.  Let  end he  ^ven  by  (3.41)  and  (3«42)i  reapeetively, 
and  let  (2.9)  obtain.  With  as  in  (3.36) • we  have»  as  N o>i 


Proof.  We  can  write 


(3.44)  “ / e d*(y) 

B 


“h„VB 


^ ] + hV  / e ^ [♦jj(y)-'l'(y)  ]dy. 

Techniques  alnilar  to  those  of  Lenina  3.4  yield  'v-  Consider 

N N 

(3.45)  B - V"^[xj^UjjrjjH)‘^Bq(y] 

• - pqh(M/Xjjyjj)**+  oKflh®)"’*] 

■ o(xr,^). 
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with  h ^lv«n  by  (3.A1)>  Henco 
(3.4*)  Cjj  ■ [l+o(l)J[(2nx^)”'*  + Ajj]. 

It  follows  from  (2.9)  that  is  smpty  for  large  Ni  hence  0 
by  Lemma  3.7.  Since  *,  Lemma  3.8  follows. 

To  complete  the  proof  of  Theorem  2.2.  apply  condition  (2.11) 
and  the  selections  of  pj^  and  h.  (3.42)  and  (3.43)»  to  Lemmas  3.1  - 
3.4  with  k»2  to  obtain 

(3.47)  PjjCXj,)  ^ Cj,  expC-xJ/2  + J(x^,  X^)]. 


Proof  of  Lennas  2.3  and  2.4 

Lemma  2.4  is  proved  by  expanding  the  logarithms. 

In  Lemma  2.3,  the  case  Xjj  ■ Ij  + 0(n”^)  is  proved  by  substi- 
tuting Pj,  ■ *1  + 0(H"^)  to  verify  that  it  is  a solution  of  (2.13). 
Otherwise,  recall  that 

(3.48)  Xjj  - p (p-WXjjTj^g/pqN  + o(N"^), 

using  (3.17)  and  (3.42).  Now,  suppose  x|j  ■ o(N)  and  propose 

(3.49)  p - Xjj  + Cj^x2/N  + o(n“^) 

an  a Aelutlon  of  (3.48).  Substitute  (3.49)  in  (3.48)  «nd  solve  for 


-27- 


Oj^  to  ottoln  ••  Xjj»  aa  In  (2.20) • Next,  ouppoae 

» o(N)  t\nd  try 

(3.50)  P - >.^y;c^(X^}x^j/II+u2xJj/n^+o(!r^) 

as  11  colnblou  of  (3.48).  Ar.ain,  iolve  for  c_  to  obtain  o-  = c.(x,,), 
an  in  (2. 21).  Hbo  hlfrhar  ottter  coefi'ii;ionto  can  bo  found 

oucoejulvi.’ly  l y coni  in.'in/f  this  Itai-ativt.  prucedu*’<:,  Thin  is  not 
done  here  bocau.u!  tl'o  alfjcbra  becoMaa  very  cmiberocu'c*  Tlio  proof  of 
Lenna  2.3  la  complete . 


Proof  of  Corollarlea  2.5  and  2.6 

Corollary  2.5  la  an  Immediate  consequence  of  Lemmas  2.3  and 
2.4,  and  Corollary  2.6  Is  proved  In  eseentlally  the  seme  manner  as 

Lemma  2.3. 
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